Global Optimization of Clusters in Gene Expression Data of DNA Microarrays by Deterministic Annealing
نویسندگان
چکیده
The analysis of DNA microarry data is one of the most important things for functional genomics research. The matrix representation of microarray data and its successive ‘optimal’incisional hyperplanes is a useful platform for developing optimization algorithms to determine the optimal partitioning of pairwise proximity matrix representing completely connected and weighted graph. We developed Deterministic Annealing (DA) approach to determine the successive optimal binary partitioning. DA algorithm demonstrated good performance with the ability to find the ‘globally optimal’binary partitions. In addition, the objects that have not been clustered at small nonzero temperature, are considered to be very sensitive to even small randomness, and can be used to estimate the reliability of the clustering.
منابع مشابه
Modification of the Fast Global K-means Using a Fuzzy Relation with Application in Microarray Data Analysis
Recognizing genes with distinctive expression levels can help in prevention, diagnosis and treatment of the diseases at the genomic level. In this paper, fast Global k-means (fast GKM) is developed for clustering the gene expression datasets. Fast GKM is a significant improvement of the k-means clustering method. It is an incremental clustering method which starts with one cluster. Iteratively ...
متن کاملAnalysis of temporal gene expression profiles: clustering by simulated annealing and determining the optimal number of clusters
MOTIVATION Cluster analysis of genome-wide expression data from DNA microarray hybridization studies has proved to be a useful tool for identifying biologically relevant groupings of genes and samples. In the present paper, we focus on several important issues related to clustering algorithms that have not yet been fully studied. RESULTS We describe a simple and robust algorithm for the clust...
متن کاملEvolution Strategy Applied to Global Optimization of Clusters in Gene Expression Data of DNA Microarrays
Cluster analysis is the most important method for analyzing large-scale gene expression patterns. The matrix representation of microarray data and its successive ‘optimal’ incisional hyperplanes that create topdown hierarchical tree are a useful platform for developing optimization algorithms to determine the ‘optimal’ clusters from a pairwise proximity matrix which represents completely connec...
متن کاملSynthesis and Expression of Modified bFGF Gene in Escherichia coli Cells
A new strategy for construction of synthetic gene encoding human basic fibroblast growth factor comprising DNA annealing-ligation and augmentation by polymerase chain reaction was introduced. The sequence of the gene and corresponding amino acid chain were modified in order to increase stability of the protein. First, 300 bp and 160 bp fragments of the gene were assembled from 18 oligonucleotid...
متن کاملA Hybrid Data Clustering Algorithm Using Modified Krill Herd Algorithm and K-MEANS
Data clustering is the process of partitioning a set of data objects into meaning clusters or groups. Due to the vast usage of clustering algorithms in many fields, a lot of research is still going on to find the best and efficient clustering algorithm. K-means is simple and easy to implement, but it suffers from initialization of cluster center and hence trapped in local optimum. In this paper...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003